ON THE LOCAL Tb THEOREM: A DIRECT PROOF UNDER DUALITY 

ASSUMPTION 



MICHAEL T. LACEY AND ANTTI V. VAHAKANGAS 

Abstract. We give a direct proof of the local Tb Theorem, in the Euclidean setting, and 
under the assumption of dual exponents. This Theorem provides a flexible framework for proving 
the boundedness of a Calderon-Zygmund operator, supposing the existence of systems of local 
accretive functions. We assume that the integrability exponents on these systems of functions 
are of the form 1/p + 1/q < 1, and provide a direct proof. The principal point of interest is 
in the use of random grids and the corresponding construction of the corona. We also utilize 
certain twisted martingale transform inequalities. 
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1. Introduction 

Our subject is the local Tb theorem, of which there are many variants. These theorems extend 
the David-Journe T1 Theorem [8], and the Tb Theorem of Christ [6] by giving very flexible sets 
of conditions under which an operator T with a Calderon-Zygmund kernel extends to a bounded 
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linear operator on L 2 . The lectures of Steve Hofmann [12] indicate the range of interests in this 
type of questions. The goal of this paper is to give a new direct proof of this result, one that 
applies when the integrability conditions imposed in the hypotheses are those of duality. We do 
this in the setting of Lebesgue measure. A companion paper [19] addresses a perfect dyadic 
variant of this same result; it contains many of the same features of this argument, with (many) 
fewer technicalities. 

We consider an operator T given in a bilinear form by 



(Tf , g) 



R n 



K(x,y)f(y)g(x) dy dx 



whenever f and g are two C°° compactly supported functions, whose supports are disjoint. The 
kernel K(x,y) satisfies these estimates for some r\ > 0: 



|K(x,-y)| < \x-y\ 



|K(x,-y) - K(x',y)| + |K(y,x) - K(y,x')| < 



|n+T| 



X 



< 



■y\ 



Define T to be the norm of T as an operator on L 2 to itself. If this operator norm is finite, we 
say that T is a Calderon-Zygmund operator. 

1.1. Definition. Fix 1 < p < oo. A collection of functions {bq : Q C R n is a cube} is called a 
system of p- accretive functions with constant 1 < A if these conditions hold for each cube Q. 

(1) bq is supported on Q and /qb^^x) dx = |Q|. 

(2) ||b Q || p < A|Q|Vp. 

The following is our main result. 

1.2. Theorem. Suppose that T is a Calderon-Zygmund operator and, for 1 < pi,p2 < oo, with 
1 /pi + 1 /p2 < 1, there are systems {bq} of r p P - accretive functions with constant A, so that for 
a finite constant T\ oc and all cubes Q, 



rn>ii^<T&Q|, 



|T*b 2 r' <T*i\Q\. 



Then, we have a quantitative estimate T <n,T],v uV2 ,A 1 + T| oc for the operator norm of J. 



This, and more has been proved in the perfect dyadic case [1]. It was the hope that the proof 
technique of that paper could be lifted to the continuous case. In some measure that turned out 
to be true, but some of the direct methods of proof [10,15] require assumptions that are stronger 
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than the duality assumption. Theorem 1.2 in duality is known, and a corollary to Auscher-Yang 
[3], who provide a reduction to the perfect dyadic case. 

More recently, Auscher-Routin [2, Theorem 3.5] have provided a direct proof in the setting 
of homogenous spaces, following an adaptation of the so-called Beylkin-Coifman-Rokhlin (BCR) 
algorithm, see [4,9], as well as the martingale transform inequalities. 

A comprehensive solution to Hofmann's problem has very recently been obtain by T. Hytonen 
and F. Nazarov, [16]. By applying perturbation techniques for both the operator and the accretive 
functions, they obtain a variant of Theorem 1.2 for all exponents pi,p2 G (1,oo). 

We provide an alternate direct proof of Theorem 1.2 in the case of duality, somewhat different 
in flavor. It is desirable to have such proofs from the viewpoint of extensions of the argument to 
other settings, such as the non-homogeneous setting [21], and those of elliptic PDEs [11]. 

Let us turn to a discussion of the proof technique. As is very common, absorbtion parameters 
enter into the proof at several stages, permitting us to resort to the assumed finite but non- 
quantitative norm bound on T, provided it is multiplied by a small absorption parameter. We use 
the well-known non-homogeneous techniques of [20], in particular the powerful technique of good 
and bad cubes. In the local Tb setting, there is however a delicate problem with the typical method 
of insertion of the good and bad cubes as is pointed out by Hytonen-Martikainen [15, Remark 
4.1]. An important innovation of this paper is a corona construction to insert goodness in a 
natural way. 

As is well known [1,2,11], by a local T1 theorem, we only need to consider the bilinear form 
(Tf|,f2), where |f|| = \iz\ = Iqo, for a fixed cube Q°. One works with two dyadic grids, D 1 
and V 2 , which are random and defined on independent probability spaces D) , ) = 1,2. A cube 
Q G D 1 has a property of being bad, in that it is close to the boundary of some significantly 
larger cube in the other grid V 2 . This property is an event in D 2 , that has probability that can be 
made arbitrarily small, giving rise to an absorption parameter. Otherwise the cube is good. One 
projects fi onto the good cubes, calling the resulting projection f-\. This makes an error that is 
small in all L p spaces on average, although the projection is now a function of O 1 and D 2 . This is 
the first of several absorbtion arguments. One then makes a standard selection of stopping cubes 
5' C V\ and local b functions b' s for S E SK The stopping cubes 5' are sparse, in the sense 
that the collection is a Carleson sequence of cubes. 

A point of departure is that one constructs functions (3 ^ which are the selected b\ projected 
away from those bad cubes which themselves have S as parent in S ] . The reason for doing so is, 
that the twisted martingale difference of fi, with respect to fi\ and over a bad cube Q with 5 1 
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parent S, will (typically) be zero — a very desirable feature. Critically, the resulting function |3g is 
also a function of Q 1 and D 2 , and in addition, the original collection of stopping cubes 5 1 is not 
so well adapted to the On the other hand, favorably to us, these functions can be viewed as 
small perturbations of b\, in the sense of probability. 

In order to adopt the usage of perturbed functions in twisted martingale differences, one cannot 
run the stopping cube selection process again, due to the unacceptable dependices on H 1 and 
D 2 . Instead, one invokes absorbtion, arguing that one can truncate the stopping tree 5 1 inside 
a set B 1 that is small on average. This completes the corona construction. The details of this 
argument take up §2, which is almost half the length of this paper. 

Turning to the remaining part of the argument, one is in a familar situation [20] where only 
good cubes P G P' and Q G V 2 need to be considered. There are additional tools that are 
quite useful, namely certain martingale transform results for twisted martingale differences, and 
associated half-twisted inequalities that are universal, in that they hold in all L q -spaces. These 
inequalities also play a crucial role in [2, Lemma 5.3]; also see the companion paper [19] that 
serves as a model for this one. The double sum over P, Q is reduced, by symmetry, to the case 
of £P > £Q, and this sum is further decomposed into subcases according to the position and size 
of Q relative to P. The case of Q deeply inside P admits a very direct control, by appeal to 
the twisted martingale transform inequalities. This is the essential paraproduct term. (Experts 
should note that we do not appeal to Carleson measure arugments at any stage of the argument; 
in this we follow [2, 17, 18].) A potentially troublesome case is when Q C 3P \ P. If Q is 
substantially smaller than P, then it is still relatively far from the boundary of P, due to goodness. 
We address this case by exploiting the smoothness condition on the kernel K, and the universal 
half-twisted inequalities. The case of P and Q having the same approximate size and position 
requires new perturbation inequalities for the twisted martingale transforms, also a new argument. 
This 'diagonal' case is the hardest one in many existing arguments, including ours. The remaining 
cases depend upon standard off-diagonal estimates for singular integrals, and universal martingale 
transform inequalities. 



Notation For a cube Q, (f)g := |Q| _1 J Q f dx, and £Q = \Q\ ]/n is the side length of the cube. 
A < B means that A < C • B, where C is an unspecified constant independent of A and B. The 
distances in R n are measured in terms of the supremum norm, |x| =11x1100 for x G R n . 
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2. The Corona 
It is a consequence of the local T1 theorem, see e.g. [1,2,11], that 

T<1+ sup |Q|- 1 ||1 Q T*1 Q || L ,+ sup | Q | 1 ||1 Q T1 Q || L , . 

QcR n cube QcR n cube 

Without loss of generality, we can assume that the last term dominates. Fix a cube Q° for which 
(2.1) T|Q°|<||l Q oTl Q o|| Ll . 

Then, for notational convenience, take two functions fi,f 2 with |f|| = |f 2 | = Iqo, so that 
||1qoT1 q o|| l i = (Tfi,f 2 ) . 
The point of the section is to devise a corona (decomposition) which proves this. 

2.2. Lemma. For every < Vo < 1, there are functions f-\ and f 2 , and a constant < C < oo 
independent of both T and T| oc , such that these conditions hold. 

||f J -f J ||2<-uolQT /2 » j = 1 > 2 , 

(2.3) |(Tf b f 2 )| < {C(1 +T, oc ) +u T}|Q°| . 

This lemma and an absorption argument complete the proof of Theorem 1.2. The construction 
of the corona, which has properties that will allow the straight forward proof of (2.3), is itself 
hardly straight forward. It will be highly dependent upon certain random constructions, and there 
will be several absorption parameters that lead to the constant Vo above. A particular obstruction, 
one that motivates our complicated corona construction, is the identification of the correct way 
to insert the goodness of cubes. 

2.1. Random Grids. We use the foundational tool of random grids, as initiated by Nazarov- 
Treil-Volberg [21], which has in turn been used repeatedly. See [13,14,18,23], to give just four 
citations touching on different topics. The properties we need are well-known, and when we 
appeal to them, are clearly identified. Throughout, we will have a random dyadic grid "D 1 for the 
functions fi,fi and a random dyadic grid V 2 for functions f 2 , f 2 . 

The random grids are constructed this way. Let T>° be the standard dyadic grid in R n . For a 
fixed cube Q 6 V°, the position of a translated cube is 



Q = QW:=Q + Y_ 2 

j : 2-i<£Q 
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which is a function of tu 1 G O 1 := ({0,1 } n ) . Denote by D 1 the collection of these translated 
cubes. The natural uniform probability measure P 1 is placed upon O 1 . That is, each component 
cu-, j G Z, has an equal probability 2~ n of taking any of the 2 n values, and all components are 
independent of each other. The expectation with respect to P 1 is denoted by E . Define D 1 in 
the same manner, with an independent copy of O 1 . It will be important to distinguish between 
these two copies, so we write cu' G D) for the elements of the probability space that define V ] . 
The product P 1 <g> P 2 is denoted by P, and the corresponding expectation E1E2 is denoted by E. 

As our construction has several parts, we will often simply write Q for a cube in T>\ instead of 
the heavier notation Q+cu' with Q G V°. Those parts of the argument that depend upon the 
construction of the random grids will be clearly identified. 

We also need some notation. Given Q G T>\ we denote by ch(Q) the 2 n dyadic children of Q. 
Given S C T>\ we can also refer to chs(S) the 5-children of S G S: The maximal elements S' of 
S that are strictly contained in S. For a cube Q G T>\ that is contained in a cube in S, we take 
7tsQ to be the 5-parent of Q: The minimal element of S that contains Q. Set, once and for all, 
the familiar and convienent choice [13,15,21] 



2(T] + n) * 

Throughout r > 3/e should be thought of as a large integer, which satisfies probability condition 
(4) below, and whose exact value is assigned later. 
We say that a cube Q G V ] is bad, if there is P G V 1 such that £(P) > 2 r £(Q) and 

dist(Q,3P) < [lQ) £ {i?y- £ . 

Otherwise, Q is good. The definitions for Q G V 1 are similar. These properties are well-known 
for a cube Q G f 1 . 

(1) The goodness/badness of Q is a random variable on D 2 ; 

(2) The position and goodness/badness of Q are independent random variables — indeed, the 
former is a function of tu 1 GO 1 ; 

(3) The probability 7t good := P 2 (Q is good) is independent of Q; 

(4) 7T ba d := 1 — Ttgood ^ 2~ eT , provided er is sufficiently large. 

Define the good and bad projections by I = P good + Pb ad , where 
P g ood4>:= D Q 4>, j = 1,2. 

QeX>J : Q is good 
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Here Dq4> = X.Q'ech(Q){( ( t ) )Q' — (4 ) )q}1q / is the usual martingale difference associated with Q. 
We have the following proposition on the norms of the bad projections. 

2.4. Proposition. For 1 < q < oo anc/{j,k} = {1,2}, there is a constant c q > so that 

E k ||PLd*H3< 2- Cq£T ||*||^. 
Here tu' is fixed, and $ G L q is any function that is independent of sequences w k G Cl k . 

We appeal to this a few times below, and the constant < c q < 1 that appears in the exponent 
on the right will be a function of pi and p2. We will supress this dependence in notation, writing 
only 2- cer . 

Proof. The basic idea is to apply Marcinkiewicz interpolation theorem to the linear operator 
P bad : L q (dx) — > L q (P k (g> dx). The projection to bad cubes is a martingale transform [5], hence 
the following inequality with no decay holds, 

E k ||PLd4>||£ < sup{||PL4>||£ : cu k G O k } < ||ct>||p , K p < oo . 
Thus, it suffices to verify the claimed decay for q = 2. To this end, we have 

E k ||P Dad 4^=E k y_ \\u Q m 

Qev> 

Q is bad 

= 7T bad ^||D Q ct)||^=7t bad ||ct)||^ 

Qex»i 

Indeed, the position of Q G £>' is determined by cu' G Q? which is fixed, and badness of Q is a 
random variable on Q k , {j, k} = {1 , 2}. □ 

2.2. Selection of fj. We will prove Lemma 2.2 by averaging over random grids. For j G {1,2}, 
let A\ denote all (at most) 2 n cubes Q G £>' such that Q n Q° * and IQ° < £Q < 2£Q°. 
Let Ai be all cubes in V ] contained in some Q G A{. Recall that functions fj are chosen in 
connection with (2.1), and they are equal to Iqo in absolute value. We define the approximates 
of these two functions to be 

f,:= £(fj) Q l Q + DQfj. 

Q is good 

In view of Proposition 2.4, we have 
(2.5) E||f j -f j ||^<2- ceT |Q°|. 
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Hence, it suffices to estimate E|(Tf 1 ,f 2 )|. 

The functions fj lie in BMO: — a dyadic variant associated with the grid £>'. It follows from 
the corresponding John-Nirenberg inequality that 

(2.6) Hfjllq < |Q°| 1/q , 1<q<oo, 

with the implied constant independent of sequences cu 1 and cu 2 . The fact that the functions fj 
can nevertheless be unbounded creates a minor set of difficulties for us. 

2.3. The Stopping Cubes Construction. In order to accomodate the good-bad decomposition, 
we will need a significant modification of the usual selection process of stopping trees and local 
b functions. Some definitions will help explain the end result that we are after. For the following 
definition, it is convenient to denote T 1 = T and T 2 = T*. 

2.7. Definition. Constants < t, 5 < 1 are fixed, and {j,k} = {1,2}. A collection of integrable 
functions {|3' s : S G 5' C £>'} is stopping data (perturbed stopping data) for a collection £/' C £>' 
of cubes if the following conditions hold with Aj = 1/2, Bj = A v > , and Cj = 5 _1 Tf k c . (in 
case of perturbed stopping data: Aj = 1/4, Bj < A v \ and Cj < 8~ ] T^ c +v v ^T v t for some 
constant < v-\ < 1 .) 

(1) Every Q G is contained in some S G SK The same holds for every child Q' G ch(Q), 
but the parent 7T 5 jQ' need not equal 7t 5 jQ, even if Q is a minimal cube in QK 

(2) If Q G Q ] with n S j Q = S (or Q G ch(R) with R G and n si Q = S), then 

(a) ((3' s )q > Aj. (Don't divide by zero.) 

(b) (\M^) Q < Bj. (Local norm of |3' s controlled.) 

(c) (IT^sI^q < Cj. (Local norm of T'(3' s is controlled.) 

(3) IZs'Gch j(s)l^'l — T l^l ^ or a " ^ e (5' is a sparse collection of cubes.) 
For Q G Q\ we define the twisted martingale difference by 

Q'ech(Q) ^ \P« 5 jQ'/Q' VPtt^q/Q ) 

This is well defined, as Q has an <S> parent, and there is no division by zero, conditions (1) and 
(2a). We also define the half-twisted martingale difference by 

(2.9) Dg4>:={ Y. -T^Q'j T^lQ- 
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Observe that here we do not multiply by a |3' function, and the sum over the children excludes 
those with a different <S' parent (this difference would force a change in the |3' function). 

The following Lemma is a key tool for eliminating the martingale differences that are associated 
with bad cubes. 

2.10. Lemma. Suppose A > 1 and < v-\ < 4~ 1 ~ n . Then, there are collections Q ] C T>i of 
cubes, j = 1,2, and perturbed stopping data {|3 S : S G S'} for such that conditions (l)-(4) 
below are satisfied. 

(1) Every cube Q G is good. 

(2) ForallQegi, (|fj|) Q < A. 

(3) For all Q G with child Q', and S G 5' with 7%jQ C S, define a constant Aq> by 



(2.11) Aq/Iq' := 1q/ Df j fj 



n si P=S 



Then, |A Q /| < A. 
(4) Assuming A -1 + Avf 1 • l' " < 1 , there holds 

E (Tf b f 2 ) - Y_ Y- (TA^f b A^f 2 ) 
(2.12) Peg 1 qeg 1 

< Ci{1 +T Ioc + (d 1 +A- 1 +AV -2- ceT )T}|Q°|. 
Here Ci = Ci (pi,p2, n, A) does not depend upon V], A, r. 

It is clear that < i>i < 1, and A> 1, and r ^> 1 are absorbtion parameters. 
The implication of (4) above is that, in order to complete the proof of Lemma 2.2, it suffices 
to verify the following inequality, 



(2.13) 



Y_ Y_ (TAjff^Aj^fz) < {C 2 {1 +T loc }+ C s rviA 2 T}\Q c 

Peg 1 Qeg 2 



We emphasize that inequality (2.13) is uniform in cu 1 and tu 2 , and that it is distinct from (2.12). 
The constant C3 = C3(pi, p 2 , Ti,r|, A), that is independent of absorption parameters, and the 
product TV] A 2 of absorption parameters appear on the right. The constant 

C 2 = C 2 (pi,p 2 ,n,Ti, A,r, A,-ut) 

is allowed to depend also upon the absorption parameters. 
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Returning to the proof of Lemma 2.2, let us consider inequalities (2.5), (2.12), and (2.13). By 
taking A > 1 sufficiently large, and then choosing r large enough and assigning ui = r~ 2 , we 
complete the proof of the aforementioned Lemma. 

The proof of (2.13), taken up in the next section, has several advantages for us. 

(1) We need only consider good cubes, which is the primary goal of the construction. 

(2) By normalizing both fi and fz with a factor A -1 , the sums of the half-twisted differences 
are controlled by c < 1 across the collections of cubes, which is related to the well-known 
telescoping property needed in the control of paraproduct terms. This normalization is 
assumed in the beginning of §3, and thereafter. 

(3) The argument to control the sum in (2.13) is now largely standard in nature, following 
the lines of [20,24], and including some inovations from [17,18], to avoid construction 
of auxilary Carleson measure estimates. However, certain perturbation inequalities are 
needed when treating cubes that are nearby, both in size and position. 

2.14. Remark. Hytonen and Martikainen [15, Remark 4.1] have pointed to serious concerns with 
some existing approaches to the insertion of goodness in local Tb theorems. The substance of 
the problem arises from the fact that the twisted martingale differences depend upon the choice 
of grid, and the collection of local b functions, making averaging arguments — such as the one 
used in the proof of Proposition 2.4 — not transparently true. Our goal is to insert goodness in a 
transparent manner, and this is one of the main contributions of the present paper. 

2.15. Remark. The dependence of the quantitative estimates on the parameters aside from T 
and T| oc is not straight forward, and generally speaking we do not track it. However, we need 
to track the dependence of a constant c on absorption parameters r, A, v-\, if it appears in an 
expression c • T. Accordingly, the symbol '<' should be read as 'the unspecified implied constant 
that depends upon parameters there is no need to track'. 

The remainder of this section is taken up with the proof of Lemma 2.10. We begin with a 
construction of auxiliary stopping data {b ] s : S E 5'} for the collection A\ which we defined 
when we selected fj. The claimed perturbed stopping data will be later constructed by using this 
auxiliary stopping data as a starting point. 

Initialize «S' to be A{. For each cube S in this collection, consider the function b' s given to us 
by the local Tb hypothesis, see the formulation of Theorem 1.2. Add to 5' the maximal dyadic 
descendants Q C S which either fail any one of the criteria (a) — (c) in Definition 2.7, with 
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Ps := H> or ^ a '' t ' ie condition 
(2.16) inf M|b ] s | p s (x) < 6 _1 A p i . 

xeQ 

Concerning these stopping conditions, let E s be the union of maximal descendents Q of S such 
that (1?s)q < I. We have, using the higher integrability of b' s , 



b 3 s dx = 


b 5 s dx + 


. s 





b J s dx 

S\E S 

< 1|S| + A|S \ Esl^WlSI 1 ^ . 

Hence, (2A)~ P ) |S| < |S \Es|. Next, let us consider the union ¥$ of maximal descendants Q of S, 
failing (2.16) or any one of the mentioned criteria (b) — (c). By inspection, |F$| ^ S J S [ . Therefore, 
with choice of 5 = 5(pj,n, A), we can continue the construction of & inductively to meet 
conditions (1) — (2) and also the sparsness condition (3) in Definition 2.7 with x = x(pj,n, A). 
This construction of <S' and {V s : S E &} is fairly standard, and we emphasize that it only 
depends upon uA Below, we will refer to S\ and its subsets, as collections of stopping cubes. 

One has the familiar representation below, of which a variant will appear later. See for instance 
[7, Lemma 3.1], [20, Lemma 4.1], or the proof of [15, Lemma 3.5]. 

2.17. Lemma. The following equality holds almost everywhere, and in L p \ j = 1,2. 

f,(x) = X.< f J>sH(*)+Z Ag'f^x). 

The last series is interpreted as limi c ._ >00 Y-QeAi-iQ>2~^ ^Q^r 

However, we would end up with certain difficulties if we applied this representation formula. In 
order to explain, let us first observe that y' 2 < Pi because 1/pi + 1 /p2 < 1. Hence, any series 
converging in L P1 converges locally in L p 2. Since U^4* is compact, this allows us to write 

(Tf b f 2 ) = Y_ ( f i)R("^R> f 2) + Y. < f 2>s(fi>T*b 2 s > 

ReAl SeAl 

"II (fi>R<f2>s<Tbl,b|> (TAp 1 ^, Ag 2 f 2 ) . 

The last series is interpreted as the limit below, 

lim lim Y Y (TA^fi, Aq f 2 ) . 

ki— >ook2— >oo * — * — ^ 

Pe-A 1 QeA 2 

£P>2~ k 1 £Q>2~ k 2 
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The focus is on this double sum. Its direct analysis contains difficulties due to the presence of 
martingale differences that are associated with bad cubes. This forces us to modify Lemma 2.17 
slightly, and the modification allows us to restrict the core summation to martingale differences 
that are associated only with good cubes. 

Once the corona construction is complete, we will assume that we have a finite sum, such as the 
ones above. Various rearrangments of the finite sums are made, and bounds will be independent 
of the finite truncations of the sums. 



2.4. Perturbation of the b functions. In a departure from standard arguments, we modify the 
b' s functions already selected, S E SK This is an important step in the reduction to a summation 
involving only good cubes. For S 6 5', we define 



(2.18) |3> 



j ._ 



b> s -|3>, where |3> := 



D Q b' s 



Q£A' : 7t~j Q=S 
Q is bad 



Note how the sum defining |3' s is formed. It is only over classical martingale differences associated 
with bad cubes in A} which have the same stopping parent. Particular care must be taken with 
these perturbations |3' s , as they are now functions of both tu 1 and cu 2 . Here, we show that |3' s is 
a small function, on average. 

2.19. Lemma. For {j, k} = {1 , 2} and all S e S\ there holds 

II PsIIbmo ~ ^ ' 

E k ||f3^<2- cei -|S|, 1<q<oo. 
Proof. Let Q G X>> be such that 7ig-Q = S. Writing ecy := Iq/cqI^.q/^sIq/ is bad , we obtain 



VPi 



H3's-(3's) Q l Pi dx 



Pi 



Q'CQ 

Y_ e Q TJ> Q ,(l Q bi 

Q'exw 



Pi 



^ IIIqHIIp, < <|Mbi|^>Q Pj IQI 1/Pj ^ IQI 



|VPj 



Here, we have appealed to the boundedness of martingale transforms, and the stopping rules. 
The remaining cases either reduce to this, or are trivial. Hence the BMO assertion is true. 
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Concerning the L q estimate, we apply Proposition 2.4 and the John-Nirenberg inequality, 

q 



E k |||3 ] s ||3=E k 
By arguing as above, we finish the proof. 



bad 



< T 



< 2- C£T \S\ 



q 

BMO 



□ 



2.5. Truncation of the Stopping Tree. We will use the functions |3' s as the basis of perturbed 
stopping data, but the path to this is not yet clear for these reasons: (A) functions fi\ are not 
suitable to form the twisted martingale differences; (B) we want an additional property related to 
goodness of these differences; and (C) the functions fj are unbounded. 

Concerning point (B), one should not fail to note that AqI need not be zero; nevertheless, 
there is a simple sufficient condition for a twisted martingale difference to be identically zero. 

2.20. Proposition. Assume that Q £ J\) is bad, and no child of Q is in SK Suppose { f3g ) q ± 0, 
where S = Tt^Q. Then, both Aqfj and Dgfj are well defined using <S' in parent selectors for |3 j 
functions, and A§fj =0 = 5^ 



q 'y 



Proof. By assumptions and definitions, the averages of fj and |3' s do not change moving from 
cube Q to a child of Q. By inspection of (2.8) and (2.9), the ratios in the definition of either 
martingale difference of f j are all well defined and equal, hence cancel. □ 

The previous considerations lead to the following three types of undesirable cubes Q £ A\ 
where A > 1 and < v-\ < 4~ 1 ~ n are absorption parameters and {j,k} = {1,2}. 

Type A: {(IMP* o |?J) > or (|"P& \<) Q > v v ^T< } or Q has a child S £ <S' 

such that {(|Mp ] s |Pi)s > ^ or (|P^ S |^) S > V V ^T^}. 
Type B: Q is not of Type A and Q has a child in <S\ and Q is bad. 
Type C: Q is not of Type A, nor Type B, and (|fj|)Q > A. 

Each of these three types depend upon both cu 1 and w 2 . Let £>'' a be the collection of maximal 
cubes in A 1 of Type cx, oc = A, B, C, and let £>' be the maximal cubes in the union of these three 
collections. Define B'' a := U{Q : Q e B^}, and B' := B'' A U B^ B U B'' c . 

It is tempting to restart the selection of the b functions for those cubes Q £ &, but this would 
introduce opaque interdependicies among the selected functions, since & is a function of both 
cu 1 and cu 2 . Instead, what we do below is truncate the stopping tree & at the collection £>'. This 
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will be an admissible step, as the sets B> are small in measure, on average, so that the resulting 
error terms can be absorbed. 

We can now define the collections Q\ and the perturbed stopping data claimed by Lemma 2.10. 
Take to be all good cubes in A\ not contained in any cube in £>'. Set 5' to be 5' less any 
cube strictly contained in some Q G BK For convenience, we also denote by W D all cubes 
in A\ both good and bad, not contained in any cube in £>'. We emphasize the fact that QeW 
is not of any Type oc, <x = A, B, C. 

Take the data for to be {|3' s : S G 5'}. Next we will show that this is actually perturbed 
stopping data, as claimed. By construction, 

(2.21) n s .Q = n §j Q } n s .Q' = n §i Q' 

if Q G W and Q' G ch(Q). Accordingly {$\ : S G S ] ) meets property (1) in Definition 2.7 
of perturbed stopping data. Another useful consequence of (2.21) is that we can compute the 
martingale differences Ap' and Dp' in case of P G TV by using freely either <S>' or & in the parent 
selectors for |3' functions. 

The sparseness property (3) is trivial for 5' since 5' satisfies it and 5' C <SK The remaining 
properties (2a) — (2c) of perturbed stopping data follow from this Lemma. It also shows that the 
set B'' A has small measure — on average. 

2.22. Lemma. Fix a cube S G 5'. Then, there holds (1) — (3) below. 

(1) <lH>s = 1. 

(2) (|S' s |Pi)s<AP i . 

(3) Let Q G W and n S ) Q = S (or Q is a child of a cube in TV and n S ) Q = S). Then, 



(a) (lH> Q > 



(b) (|M|3 ] s |p0q ^ 6 _1 ATi. 

(c) (\V& s \*) Q *b-iT& + v*T&, 

where {) , k} = {1 , 2}. Finally, 



(2.23) E k |B'' A | £ V'2" cer |Q°| . 

Proof. It is immediate from Definition (2.18) that |S| = J s b' s dx = J s S' s dx, so claim (1) 
follows. By the boundedness of martingale transforms, claim (2) is also clear: 



IP'sl Pi dx< 



|b' s | p i dx< A p i|S| 
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The properties in (3a) — (3c) are a consequence of equation (2.21) and the failure of condition 
defining Type A cubes. Indeed, if Q G TV and 7r 5 jQ = S, 

mQ>H)Q-(mi\ Vi )Q Vi >\-^, 

which is greater than 1 /4. (Recall that stopping data is slightly stronger on this point.) If Q is a 
child of a cube in TV and 7t 5 jQ = S, then either Q 6 S\ in which case (|3' s )q = (|3q)q = 1 , or 
the estimate follows as above by first comparing the average of \$\\ on Q to its average on nQ. 
Let us then consider (3b) and (3c) for Q e TV . By sub-linearity and stopping rules, 

(|M|3 ] s Hq £ (|Mb'sHQ + (WsHq < 5- 1 A^ + v v j < S^ATJ . 
Likewise, 

(|T j |3 J s |^) Q < (\Vb\\<) Q + (\V$ S \*) Q < 5- 1 Tfi +v$T* . 

The claims for a child Q of a cube in TV follow by comparing the average on Q to that on nQ, 
in case of Q $ S\ and from the stopping rules in case of Q e S\ 

Turning to inequality (2.23), i.e. showing that the set B^' A has small expected measure, recall 
that the collection & is only a function of uA Moreover, by sparsness of this collection, 

}JS|<-L-|Q |<|Q |. 

^ I — T 

sesi 

A cube is of Type A for four potential reasons. Fix S G 5', and let £>^' A ' be the maximal cubes 
QeA ] with n$Q = S, and (\M^{\^) Q > u p \ By Lemma 2.19, 

E k Y_ IQI ^ v7 Pj E k |P 5 s |Ti <i>7 Pi 2- cer |S| . 

Second, let B^ Al be the maximal cubes Q e A] with n^Q = S, and (|T^ 5 S |^) Q > T<v^. 
Then, using the a priori norm bound cT for the operator V on L p k and inequality p{ < Pj, 

E k Y_ IQ|sV*-E k l(3sl Pk <-u7 Pi 2 _cer |S| . 
QeB^ 

Third, let 23 j ' A3 be the collection of cubes Q in A\ having a child S G 5' with (|M.j3' s | p >)s > u Pi . 
Then, 

E k Y_ IQI^i>7 Pi ^E k IPs!* <-u7 Pi2_C£T }l|S| < u7 Pi 2- cer |Q°| . 
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A similar estimate for the remaining collection B^ Aa of cubes Q in A\ having a child S 6 5' such 
that (|T j j3' s | p k)s > u^T^, finishes the proof of Lemma. □ 

Let us check conditions (1) — (3) of Lemma 2.10. Every cube Q G is good by definition and, 
by construction, (|f j [) q < A. Let us consider the property (3), concerning the sum of half-twisted 
differences in (2.11). For all Q E G ] with child Q', and S G S ] with 7t 5i Q C S, there holds 
|Aq/| < A, where Aq/ is the constant defined by 

Aq/1q/ := 1q/ ^ D^'fj . 

?eA> : PDQ 

7T sj P=S 

In contrast to the sum in (2.11), the sum above extends over all cubes with the same 5' parent. 
Hence, by inspection of (2.9), the sum above is telescoping to the difference of two ratios (or 
to a single ratio). On the numerator of the ratios, are averages of fj, which are bounded by the 
definition of Type C. The denominator of the ratios is an average of |3' s , which is bounded below 
by 1/4 because of condition (3a) in Lemma 2.22. Hence, [ Aq/ | < A. Finally, let P be a bad cube 
in the sum above, then P G TV and it has no stopping children in 5' by construction. By property 
(2.21) and Proposition 2.20, we find that Sjffj = 0. Thus, (2.11) holds. 

2.6. Completion of the proof of Lemma 2.10. The remainder of the proof of Lemma 2.10 is 
taken up with the proof of (2.12). This inequality implies that it suffices to bound the sum over 
the collection Q ] x Q 2 , compare to inequality (2.13). Let us begin with probability statements 
involving smallness of sets B'. 

We have seen that B'' A has small measure on expectation. The same fact for B' ,B is straight 
forward. Indeed, the collection 5' is only a function of w\ and holding that variable fixed, the 
event that S G has a bad parent is an event in O k , where {j,k} = {1,2}. And so, 

(2.24) E k |B'< B | < 2 n • E k JjS^s is bad < y^-^Q ! < 2" eT |Q°| . 

Se5i 

For the remaining set B'' c , recall that fj is a dyadic BMO function, uniformly over cu 1 and cu 2 . 
More precisely, by Chebyshev's inequality and (2.6), we have 

(2.25) |B'' C | = Y_ IQI ^ K Mf i > A H ^ A ~ 2Pi ll Mf jll2pj ^ A- 2p i|Q°| . 
We have verified that 

(2.26) E|B j | =E j E k |B j | < {A" 2 * + v; Pj 2- C£T } |Q°| . 
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Indeed, this follows by combining inequalities (2.23), (2.24), and (2.25). 

We turn to a variant of Lemma 2.17, adapted to our construction. Since we have truncated 
our stopping tree, the statement is not as clean. We need to define 4>' := 2^Qesi 4>q, where 
4>' Q = fjl Q if Q e £>' n A\ and, otherwise, 

(fj>Q 



fjlQ "(S^^ iQlQ - 



We claim that 



(2.27) £A*>|B'|. 

Recall that is a function of (tu\uj 2 ) and is of small measure in expectation. Let us prove 
inequality (2.27). If Q e & n A{ then, by (2.6), H^'qH^ <|Q°| 1/Pi < IB'I 1 ^. There are at most 
2 n such cubes. For the remaining terms, we notice that as f, is BMO, and the average values of 
f; are controlled, 



qeBi\A{ 



L 

Q 



<fj>Q 



P 



(2.28) 



IfjlQ 



(^ 5j Q)Q 

-(f,>QlQ||g+ (fj>Ql Q 



Q 

<A p s Y_ |Q| < A p i|B j | . 

QeBi\Al 



<fj>Q 



(PL,q)q 



PL 



We used definition of Type C cubes and Lemma 2.22, along with the observation that the parent 
of Q is in TV if Q e £>' \A{. This concludes the proof of inequality (2.27). 
Concerning the representation of fj, we have 

2.29. Lemma. The following equality holds almost everywhere and in L v ' . 

(2.30) f,(x)= Y. ( f j>QpQM + L A^(x) + ^(x). 

Proof. Let Q be any bad cube, which is not contained in a cube in £>'. By construction and 
Proposition 2.20, Aqfj = 0. It follows that for any x G Bj, the sum above is in fact finite, and 
telescoping. By inspection, it is equal to fj(x). 

Consider x £ Bj, then note that for any cube P 3 x, 



(fj> 



i/p 



QeAi\Bi 



Qeg> : PgQ 



P 
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Now, since <S' is sparse, almost every x is in only a finite number of cubes S G SK Hence, for 
all P sufficiently small, as a function of S, 7t 5 jP = S does not change. And, by the martingale 
convergence theorem, for almost every x G S, 

(|3' s )p -» |3' s (x) and (fj) P -> fj(x) . 

The convergence is as £P — > 0, with x G P. The convergence is in L p >. Moreover, by the stopping 
rules, the averages of |3' s are bounded from above and below, so that L p ) convergence also holds 
for the inverses as well. □ 

We can conclude the proof of Lemma 2.10 this way. It remains to verify (2.12). In expanding 
(Tfi,f2) using (2.30), there are a number of terms. Note that we have these estimates, and their 
duals, as applicable, which we do not directly state. For P G A\\B\ the cubes P and Q° are 
roughly of the same size, so that |(fi)p| < 1 by inequality (2.6). Furthermore, using the local Tb 
hypothesis, definition of Type A cubes, and the Hardy inequality stated in Lemma 3.17, 

|(Tf3 P ,f 2 )| < |(T(3 P ,f 2 lp)| + |(T(3p,f 2 l 6 p\p)| <{1 +T, oc + v 1 T}|Q°|. 

And for V E A\\B\ and Qe^\ B 2 , by the same reasoning, we have 

|(Tf3 P ,|3 2 Q )|^{l+T, 0C +-u 1 T}|Q°|. 

Next, for P as above, there holds by the assumed norm inequality on T, and inequalities (2.27) 
and (2.26), 

E{ | (T|3 P , 4> 2 > | + | (Teh 1 , f 2 ) | } < T ■ {A- 1 + AV • 2- cer } • | Q°| . 
Lastly, when A -1 + Av^2~ C£T < 1 , we have 

E| (Tcf) 1 , 4> 2 )| < T- {A" 1 + Ad7 1 ■ 2-°"} ■ | Q° | . 
Collectively, combined with (2.6), these inequalities complete the proof of (2.12). 

3. Useful Inequalities 

3.1. The Martingale Transform Inequalities. We recall essential tools that we will need. Fix 
a function b supported on a dyadic 1 cube So, satisfying Jb dx = |So|, and ||b|| p < B|So| 1//p , 
where 1 < p < oo is fixed. We will consider a fixed but arbitrary collection T of disjoint dyadic 

1 In our applications, the underlying dyadic system will be V\ ) G {1,2}. 
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cubes in So, the 'terminal cubes'. Let Q be all dyadic cubes, contained in So, but not contained 
in any terminal cube T G T. We require that there is cr G (0, 1 ) such that, for all Q G Q, 



(3.1) 



b dx 



>4 _1 |Q| and 



|b| p dx < o-^B p |Q| . 



For each terminal cube T, we have a function bj supported on T, and satisfying Jbj dx = |T| 
and ||b T || p < B|T| 1/Ap . If the conditions above are met, then we say that the collection, comprised 
of functions b and bj, T G T, is admissible. We will not keep track of the constants cr and B, 
and the implied constants will depend upon them. 

For Q G Q we define the (half) twisted martingale differences 

i (f>Q' _ (f>Q 



D b Q f :-- 



D b f 



A b Q f := 



Q'ech(Q)\T k 

L 

Q'ech(Q)\r 



0V 



Q' 



Q'ech(Q) 



(f) 



Q' 



(^Q')q' 



bn/ 



1q' ) 

(f)Q 
(b) Q 

(b)o 



where we set bqi = b if Q' <£ T and otherwise, bq/ is defined as above. 
This is proved in [2, Lemma 5.3] and [19, Section 2]. 

3.2. Theorem. Suppose thatb andbj, T G T, constitutes an admissible collection. Then, these 
inequalities hold for all selection of constants |eq| < 1. 



^ £Q D b Q f + ^£ Q D b Q f 

QeS q Q6S 

^ £Q A b f <||f|| p , 

QeS P 



< llfl 



q ' 



1 < q < oo , 



where 1 < p < oo is the exponent associated with the admissible function b. 

We will recourse to the following Theorem several times. Aside from Theorem 3.2, it depends 
upon the sparseness of the stopping tree SK 

3.3. Theorem. For each cube Q in R n , and any selection of coefficients |e P | < 1, 



(3.4) 



Pea' : PCQ 



I/Pi 



Pi 



T/?e same statement holds true also with A\, replaced by Ap 
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Before the proof of Theorem, let us make the following instructive remark. 



3.5. Remark. Of particular importance in the sequel will be the following assignments. For a fixed 
So G Si that is not contained in a cube in B\ we set T C TP to be maximal cubes in the collection 

ch 5j (S ) U{T : T c S , T g ch(R), R G £>'} . 

By construction of our perturbed stopping data, it is straight forward to verify that the assignments 
(3 := |3' So and 



|3 T := 



b T T G ch(R) for some R G & 
|3 T otherwise 



yields an admissible collection, with p = Pj and constants cr ~ 5 and B ~ A. Likewise, setting 
b := b' So and bj := by if T G T yields admissible functions. Observe also that P G Q if P G & 
satisfies n S jP = So- Moreover, under the same assumption, Ap = Ap' and Ap = Ap' . Here, 
the right hand sides are defined in (2.8). Observe that the terminal functions |3t and b T for 
T G Tflch(R), R G B\ do not play any role in these last identities. 

Proof of Theorem 3.3. By considering the disjoint collection of those maximal cubes in Q\ that 
are contained in Q, we are reduced to the case of Q G Q\ By Theorem 3.2 and Remark 3.5, we 
first obtain a weaker inequality. Indeed, letting S = 7t 5 jQ, we have 



(3.6) 



PCQ 



Pi 



* HfjlQllPi 



,*IQI 



i/Pi 



We have the last inequality due to the construction of functions fj: — compare to inequalities in 
(2.28) and recall the normalization of f j by A -1 . 

We apply inequality (3.6) recursively for the remaining terms, for which n s \? c Q. Let Tly be 
the maximal R G & strictly contained in Q, and inductively set 7^+1 to be the maximal cubes 
R' G 5' strictly contained in any R G IZ^. By sparsness of <S ] , 



Y_ |R|<T^|R|<---<T k |Q|, 



k > 1 



where < t < 1 . Thus, setting 4> k := Y.Ren X.p • n p=r £p^p'fj. there holds 



SJ 1 



k=1 v i k=1 



Pi 



< 
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00 1 Vp - r 

k=1 k 



£|QI 



i/Pj 



The proof in case of b'-functions is the same. 



□ 



We need a variant of the q-universal inequality for the half-twisted differences to control several 
error terms that arise. For P G Q\ let us define 

(3.7) □? i f j :=|D^'f j |+x P , 
where 

lp a child of P is in <S> 



Xp := 



otherwise 



The functions Dp fj are defined analogously. If the applied function |P or b' is clear from the 
context, we omit the superscripts. The following q-universal inequality is a consequence of 
sparsness of the stopping cubes 5' and the half-twisted inequality, Theorem 3.2. 



(3.8) 



L 1 

Pee' : PCQ 



n p f-l 

Up Tj| 



1/2 



£|QI 



Vq 



1 < q < 00 



Here Q is any cube in R n , and the same inequality is true if we use b'-functions. 
For further details concerning the proof of (3.8), we refer to [19, Section 5]. 

3.2. An Estimate for Perturbations of b. For a later discussion of the diagonal term in §5.1, we 
need novel perturbation inequalities for the twisted martingale differences. We will first formulate 
and prove general statements, and only afterwards specialize to our setting. 

Let So be a dyadic cube, and T be a collection of disjoint dyadic subcubes of So- Let Q be the 
collection of all dyadic subcubes of So which are not contained in any T G T. We suppose there 
are two admissible collections of functions 2 : b and |3, and the corresponding terminal functions 
b T and |3 T for each T G T; for the definitions, we refer to §3.1. Assume further that for all 
Q G Q and T G T there holds, for a fixed < v < 8 _1 , 



(3.9) 



|b-|3| p dx<D p |Q|, 



|b T - |3 T | P < u P |T| • 



These conditions say that b and |3, and the corresponding terminal functions, are 'close'. 



2 With exponent p and constants a, B 
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3.10. Theorem. Suppose inequalities (3.9) hold for some < v < 8 \ and \(f)o\ < A for every 
cube Q G QUT. Then, we have this estimate. 



QeQ 



1/2 



<v{||f-ls ||p + A|S |^}. 



Here the exponent p /s the one associated with functions b and (3, and t/?e implied constant 
depends upon n, p, 0, B. 

This section is devoted to the proof, which is a variant of known techniques [2,19]. The proof 
relies on the crucial martingale transform inequality. Main Lemma follows. 

3.11. Lemma. Suppose that v and f are as in Theorem 3.10. Then, we have the estimate 



(3.12) 



D 5- D &}f| 

QeQ 



iV2 



<v{||f-ls || p + A|S o r}. 



Proof. We begin with preparations. Fix Q 6 Q and Q' e ch(Q) \ T. Set |3 = b — (3, and write 
|3k,Q := ((P)Q/(|3)Q) k . Define P^q/ analogously. Observe the inequalities for every k > 1 . 



(3.13) ||3 k)Q ,| + ||3 k , Q | <2-(4u) k , IPicQ'-Picq|<|0i,q'-Pi,q|-M8v 



ilc-1 



Indeed, these follow from inequalities (3.1) and (3.9), and the fact that Q, Q' G Q. For the latter 
inequality above, one also applies the Mean Value Theorem. 
Then we write 

1 1 If- <P)q 



(3.14) 



(P)q (b) Q (P)qI 1 (P)q + (P)q 
1 



}j-1) k+1 Pk,Q. 



Using the same expansion with Q replaced by Q' yields inequality 



|{D^-D b Q }f|-l Q ,<^ 



k=l 



<f>Q' n <f>Q a 

■Pk,Q' — /Q , Pk,Q 



(P)q' 



(P)q 1 



lo'. 



Then, for a fixed k, write the summand on the right hand side as 

Pk.QDqf • 1q' + |j^y^{Pk,Q' - Pk,Q} ■ 1q' 

< 2(4v) k ■ |D^f I • 1 Q / +4A • k • (8v) k - : ■ |p 1)Q , - p 1)Q | • 1 Q 
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Here we used assumptions and inequalities (3.13). Observe that | |3 1t q/ — P^qI • 1q/ = |Dq(3| • 1q/. 
By summing the series over k, and then summing resulting estimates over Q' e ch(Q) \ T, 

|{DP - D b Q }f | < v\D^f\ + A|D^ p| =v\D* q (f • l So )| + A|D^(P • 1 S J| . 

Inequality (3.12) follows by using (3.9) and universal martingale transform inequalities with q = p, 



Theorem 3.2. 



□ 



Proof of Theorem 3.10. For Q e Q, we write 

A&f - A b Q f 

= {DPf-D b Q f}-b + D^f-(|3-b) 



Q'Gch(Q)nT 



L Q' > 



where we have denoted 



Fq< := (f) Q '-{P Q '-M» 



<f>Q 



1 



{(P)q (b) Q 



b, F 



Q' ■• 



(f>Q 



1 



<0>Q 



Having Lemma 3.11 and martingale difference inequalities, we can proceed as in [19, Section 2]. 
For the convenience of the reader, we briefly recall this argument here. Let 



Sf := 



QeS 



and consider the events E t := {|Sf| > t} C So, where t > 0. It is crucial to realize that we 
can compare Lebesgue measure estimates and estimates with respect to ||3 — b| p dx. Namely, 
by inequality (3.9), the Lebesgue Differentiation Theorem, and the fact that Sf is constant on 
terminal cubes TeT, 



|p-b| p dx<2 n -u p |E t | 



Therefore, by a standard formula and the Lebesgue measure estimates of Theorem 3.2, 



|SfH|3-b| p dx = p 



Et 



||3-b| p dxdt<-u p ||f -ls l 



The square function of [Dqf — Dgf} -b is estimated analogously, using Lemma 3.11, which also 
contributes the constant v. The remaining square functions, associated with Fq/, i = 1,2,3, are 
estimated by using the fact that T is a disjoint collection and Lebesgue measure is doubling. In 
case of i = 2, we also use expansion (3.14) and first inequality in (3.13). □ 
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We specialize the perturbation inequalities to our setting. 



3.15. Theorem. For ) = 1,2 and <v-\ < 4 



-1-n 



we have this estimate. 



(3.16) 



Q£G> 



1/2 



on/Pi 



Pi 



Proof. Let 1Z = A\, and inductively set TZ\c+] to be the maximal cubes S' G $ strictly contained 
in any S G 7^- By sparseness of S\ 

^|S|<T k |Q°|, k>0. 
By disjointness of each collection lZ k , the left hand side of (3.16) is bounded by 

oo r 

L L L |K-^}fi 



7t sj Q=S 



2" 


1/2 


Pi- 


VP 






Pi - 





Fix k > and So = S G lZ k . The basic reduction to a square function involving cubes Q G <2 
and differences Aq and Aq is described in Remark 3.5. We apply Theorem 3.10, and therefore 
we need to verify that its assumptions are satisfied. 

To this end, we can clearly assume that there is a cube Q G such that 7r 5 jQ = So- As a 
consequence, So G «S' is not contained in any cube in £>\ and a case study using definition of 
Type C cubes shows that |(f j ) Q | < 1 if Q G Q U T and ||fj • l So || p . < |S | 1/Pi (recall that fj is 
in dyadic BMO and the normalization by A -1 takes place). Moreover, the same fact about So 
combined with definition of Type A cubes implies that 



|b-|3P 



For the second condition in (3.9) for T G T, we first observe that b T — |3 T = if T is a child of 
a cube in BK In complementary case, T G ch 5 j(So), and its parent is not contained in any cube 
in BK Thus, 



(3t| p 



ipvr <^it 



by definition of Type A cubes. 

The proof is finished by using Theorem 3.10 and appealing to previous inequalities. 



□ 
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3.3. Hardy Inequality. We record a Hardy inequality that we will refer to several times. For a 
proof, we refer to [2, Section 9]. 



3.17. Lemma. Let Q be any cube in R n and k > 1 . For every 1 < p < oo, there holds 



kQ\Q J 



— — dijdx < II^Jpll^Hp/ , 1/p + 1/p=1 



T/?e implied constant depends upon K,p,n. 



4. The Inner Product and the Main Term 

During the course of the remaining sections, we prove inequality (2.13), which then completes 
the proof of our main result, Theorem 1.2. The sum in (2.13) is further split into dual triangular 
sums, one of which is the sum over (P, Q) G Q ] x Q 1 such that IV > £Q. By using goodness, 
this sum is further decomposed into different parts, as below. 

V f3r := {(P, Q) G Q" x Q 1 : 3P n Q = , ZQ < £P} 
diagonal := {(P, Q) G ^ x Q 1 \ V far : 2- r £P < £Q < £P} 

^nearby == {(P, Q) G 6? 1 X Q 1 \ P diagonal I Q C 3P \ P} 

inside := {(P, Q)e& xg 2 \ P diagona , : Q C P} 

The sums over these collections of pairs of cubes are handled separately, and aside from the 
'inside' term, one can sum over the absolute value of the inner products. 

The main tools to control these terms are the twisted martingale transform inequality combined 
with the local Tb hypothesis. Moreover, all cubes are good, a point used systematically, but not 
so noted in the notation. It is frequently combined with the smoothness condition on the kernel, 
to see that certain maximal functions applied to the (3 functions appear. That these maximal 
functions are controlled is a consequence of the selection of stopping cubes, combined with the 
universal half-twisted martingale inequalities. In the analysis of the diagonal term, the perturbation 
inequalities established in §3 play a key role. We recall that the functions fj have been normalized, 
allowing us to assume that A = 1 . 

In this section, we concentrate on the 'inside' term, which is the main term. 

The condition for (P, Q) G inside is that Q C P and 2 T £Q < IP, which are abbreviated to 
Q <s P below. Even though Q is in a different grid, it is good, hence necessarily a child of P 
contains it, and we denote that child by Pq. A book keeping issue motivates the next steps. Set 
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Apfi := Dpfi • p], p, where the half-twisted martingale difference Dp = Dp of (2.9), does not 



sum over of the children of P that have a different stopping parent from that of P. 
It suffices to bound the sum over S G S ] of the terms 



(4.1) 



Y_ (fi)s(T|3 s ,A Q f 2 ) 

Q : QisttS 



+ 



Y_ Y- ( TA p f i,A Q f 2 ) . 

P : n s i P=S Q : Q«P 

The point of this step is that the argument of T depends only on The required bound is 

(4.1) <T loc |S|, f:=T, 0C +viT, 

which is essentially the term that appears in Lemma 2.22, condition (3c), and the implied constant 
is not allowed to depend upon the absorption parameters. Since the collection S ] is sparse, this 
is clearly summable over S G 5 1 to a multiple of T ]oc | Q° | . 

The first term on the right in (4.1) is easy. First of all, by the the local Tb properties stated in 
Lemma 2.22, and the twisted martingale inequality (3.4), 



Y_ (fi)s(T(3 s ,A Q f 2 ) 



Q : QcttS 
QcS 



STWISI 1 ^ 



Y_ ( f i)sA Q f 2 

Q : QisttS 



<T loc |S| 



P2 



The remaining part of the first term is a sum over cubes Q m 7tS for which Q fl S 
is estimated by using Hardy's inequality, Lemma 3.17, and inequality p 2 < pi. 
For the remaining terms, the argument of T is written this way. With Q <e P, 

A P f, = (Dpf 1 ) Pq • pi l s - (D P f 1 )p Q • P s 1 S \p q + A P f, ■ 1p\ Pq 



This part 



=: A{! ara f , - Ap top f , + Af ro % . 

Note that we treat Apf!lp Q as the main contribution, but write lp = 1 
a decomposition of Bs(fi,f 2 ), the second term on display (4.1) without the absolute values, into 
the paraproduct term, the stopping term, and the error term, written as 



ls\p Q ■ This leads to 



gpara, 



'(fi,f2)-Br p (fi,f2)+Br or (fi,f2). 

The terminology is drawn from [22,24]. 

4.1. Control of the Paraproduct Term. This very brief argument is in fact the core of the 
proof. Consider Bg 3 ^^ , f 2 ). In this term, the argument of T is a certain multiple of P s . For 
relevant cubes Q, let us define 

e Q := Y_ (Dpfi)p Q - 

P : tt s1 ?=S 
Q«=P 
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The condition 3 (2.11) of Lemma 2.10, was designed for the implication that the numbers £q are 
uniformly bounded. Therefore, we can write 



Br a (fbf 2 ) 



P : 7t 5 i P=S Q : QeP 



Q : QeS 

< ||l s -T^|| p , Y_ £ Q A Q f 2 

Q:QeS 



< 



T, 0C |S| , 



P2 



where we have appealed to the local Tb hypothesis, the construction of $\, and the martingale 
transform inequality (3.4). 

4.2. The Stopping Term. Recall that 

Bf*(fi,f2)= Y. Y (Dpfi) PQ (T(^l S \P Q ),A Q f 2 ). 

P : 7t 5 , P=S Q : QeP 

We bound this by a constant times |S|, and to do so we will appeal to (a) that Agf 2 has integral 
zero and kernel of T has smoothness, and (b) the universal half-twisted inequality (3.8). This is 
the argument we will use in those instances when the cubes are relatively close. 

For integers s > r, we restrict the side length of Q so that 2 S £Q = IP, and obtain a further 
geometric decay in s. Define 

B s s7(fi,f2):= Y. I- (D P fi)p Q (T(^ls\p p ),A Q f 2 ) 

P : 7t„! P=S Q : QeP 
2 s £Q=fP 

By goodness, there holds dist(S \ Pq, Q) > (£Q) e (£PQ) 1_e . Therefore, by using the smoothness 
condition on the kernel, and the mean zero property of Aqf 2 , we can estimate the inner product 
below as follows, in which xq is the center of Q. 



|(T(^l S \ PQ ),A Q f 2 



Q J 



S\Pq 



{K(x,y)-K(x Q ,y)}^(y)A Q f 2 (x) dydx 



< 



(iQy 



s\ Pq \*-y\ n+r] 



<2- Vs inf MpJ(x) 
xeQ 



||3i(tj)A Q f 2 (x)| dydx 



□qf 2 dx 



3 The condition applies with the minimal cube in t/ 1 , subject to the summation conditions, instead of Q. 
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This is a standard off-diagonal estimate for singular integrals, combined with the goodness of Q 
and the corona construction, and we are also using definition (3.7). Observe that we have already 
gained a geometric decay in s, where r\' = (1 — e) • r\ > 0. 

Since cubes Q with same side length, specified by P, are disjoint, there is a simple appeal to 
Cauchy-Schwarz that we use. Following that, we use the trilinear form of Holder's inequality, with 
indices p! , 2p{, 2pj, and the universal half-twisted inequality (3.8). 



(4.2) iB^f^l-^ Y_ L <l^ f il>P 



P : 7t_i P=S Q : QeP 
2 s tQ=«P 



M|3 S • n Q f 2 dx 



<2^ 



MB 

< 2^' S |S| 1/P 



lV2r 



Y_ dDpf^-lp Y. l°Q f 2 

P : 7t s , P=S Q : Q«=S 

.1 V2 



1 V2 



dx 



Y_ i MD p f i 



P : 7i s , P=S 



Y_ l n Q f 2l 

Q:Q«S 



"I V2 



2P,' 



<2- lls |S|. 

This completes the analysis of stopping term. 
4.3. The Error Term. We control 



B e s rror (fi,f2)= Y_ Y_ L (TtApivlp^AQfa) 



s=r+l P : P=S Q : QgP 
2 S «Q=£P 

With s > r fixed, call the sum above Bf^fi, f 2 ). We obtain a geometric decay in s, by using 
essentially the same argument as just presented. Indeed, 



lOW, •l PV p Q ),A Q f 2 | 



Q J 



P\Pq 



< 



JQ J 

< 2^ 



P\Pq 



{K(x,-y)-K(x Q ,-y)}Apf 1 (y)AQf 2 (x) dijdx 
^-^-|A P f 1 (y)A Q f 2 (x)| dydx 



iDpf^p ■ inf M8 S • 

xGQ 



□Q"f 2 dx . 



The argument of (4.2) then yields |B|7 r (f b f 2 )| < 2^' S \S\. 



5. The Remaining Terms 
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5.1. The Diagonal Term. The diagonal term is the hardest in many arguments, including ours. 
The estimate we will prove is of this form. 



(5.1) 



L 

Peg' Qeg 2 

Qn3P#0,2- r ?P<«Q<«P 



Y_ (TApf i , AQ-f 2 ) < {C T (1 +T, 0C )+n) 1 T}|Q°|. 



Note in particular that the bound in terms of T has leading absorbing constant rv-\. On the 
other hand, T| oc has a leading constant C T that will be exponential in r. The implied constant is 
independent of the absorption parameters. 

The first step in the proof is not so straight forward. Its purpose is to avoid terms {2 cr ViT}|Q°| 
that cannot be absorbed. To explain, let us pass back to the heavier notation Apfi = A P f|. The 
point of the estimate below is that we will replace (3 1 in the twisted differences by b 1 . 



5.2. Lemma. There holds 



Pee 1 Qeg 2 

Qn3P*0,2- T fP<«Q<«P 



(T(A P p f,-A^ fO,A Q f 2 ) 



<{r-U!T}|Q c 



Proof. Perturbation inequality is the principal tool here. By introducing independent Rademacher 
variables {e P } Pgg i that are jointly supported on a probability space O = {-1, ]} g , we have, for 
integers < s < r, 



L L 

Pee' Qeg 2 

Qn3P#0,2- s «P={Q 



(T(A? fi-A? fi),A Q f 2 ) 



a 



^epTfAf'fi-A^fO^eR ^ A Q f 2 \de 



Peg' 



Res' 



< 



a 



T(X.e P {Af , -A? , }f 1 



Pee 1 



Pi 



Qeg 2 
Qn3R5t0,2- s eR=«Q 



de 



pi 



a 



^ ^ e P A Q f 2 

Peg' Qeg 2 

Qn3P5t0,2- s fP=fQ 



de } 
Pi J 



Extract the operator norm from the first factor, and after that apply Khintchine's inequality and 
Theorem 3.15. Theorem 3.3 is used to estimate the second factor, but only after having changed 
the order of summation and having applied the Holder's inequality and inequality ~p\ < p 2 . Finally, 
summing the s-series yields the upper bound tdtTIQ ). □ 
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Assume that 3P n Q * 0, and that 2~ T £P < £Q < IP. We prove 
(5.3) \(JA^i u A Q i 2 )\ £{1 +T loc }-(^ , f 1 ) P (D Q f 2 ) Q |P|. 

Note that we are using Dp' fi on the right hand side. A straight forward appeal to (3.8) completes 
the proof of the diagonal estimate (5.1). 

The cube P has 2 n children P'. If the child P' is not a stopping cube, A^f-\ ■ lp/ is equal to a 
multiple of b\ lp/, where Si is the S ] parent of P, and the multiple is given by the value of the 
half-twisted martingale difference Dp' fi on P'. If P' is also a stopping cube, then Ap' fi ■ lp/ in 
addition involves a bounded multiple of b p ,. In any case, the constant multiples are bounded in 
absolute value by (dp' fi)p, compare to definition (3.7). Similar comments apply to Aqf 2 = Aq f 2 
restricted to a child Q'. By these considerations, it suffices to prove the estimate 

|(Tj;> 2 >|<{1+T loc }|P|, 

where ip 1 = b^lp/, and i]j 2 G {|3| 2 1q/, |3q,1{q/ 6<S 2}}, where S 2 is the S 2 parent of Q. A similar 
estimate is also required when tJj 1 = b P ,, on the condition that P' G 5 1 . The obstruction to these 
estimates is that the stopping conditions control the local norm of Tb^ , but above, we have the 
restriction on bL inside the operator T. 

The case of ij) 1 = b p ,, with P' G <S 1 , is especially easy, since the obstruction just mentioned 
does not arise. By construction of the stopping cubes, and the fact that Lebesgue measure is 
doubling, 

|(Tb p „^ 2 l P /)| < ||l P /Tb P ,|| p /||i|; 2 || P2 < T, oc |P| . 

Above, we have been careful to restrict the inner product to integration over P'. In the comple- 
mentary case, inequality p| < p 2 and the Hardy inequality given in Lemma 3.17 implies 

|(Tb P ,,i|; 2 l Q / X p/)|<|P|. 

In the case of ip 1 = b\ lp/ we must face the obstruction. Write 

x^ 2 = Vip/ +-qj 2 i Q /\ P / . 

The Hardy inequality controls the second term, giving 

KTb^ip,,^ 2 !^,)! < |P|. 

For the first term, return to the local Tb hypothesis, and write 

^ 2 1 P , = (-qj 2 )p,bp, + (iI> 2 l P , - (ijj 2 )p/b 2 ,) =: (Wb 2 , + ^> 2 . 
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The advantage of the first summand on the right is that the local Tb hypothesis gives us 

|(W ' (TbJ, lr>M>)\ = IU>V ■ (b^lp^TVp,)! 

<T loc |(^ 2 ) p; |-|P| 
<T loc |P|. 

The advantage of the second summand is that it has integral zero: J P ,il> 2 dx = 0. Note that 
also ||iF||p 2 < |P| 1/P2 . Take V to be the cubes of the form P'+u, where u £ {-1,0, 1} n -0. 
Then, 

|(Tb^lp,,^ 2 )| < |(Tb^ip 2 )| + |(Tb^l SAP ^ 2 )| 

< |(Tb^,ip 2 )| +^|(Tb^l R ,^ 2 )| + |{Tb^l SA3 p,,^ 2 )| . 
ReV 

The first term is controlled by the stopping rules: 

|<Tb^,xp>|<T loc |P|. 

The second sum is finite, and each summand is precisely of the type that appears in the Hardy 
Inequality. Indeed, although R £ V need not be contained in P, by (2.16) we nevertheless have 



(K\ v ^<(Kn^ <{4 n mfM|b^r} 1/p ' <i. 

From this, it follows that 

Li(^s 1 iR,-$ 2 )i^LKiRik-i p i 1/P2 ^i p i- 

ReV ReV 
By a similar estimate as in (the proof of) Lemma 5.7, and the stopping rules, it follows that 

I0K Isasp'^ 2 )! < mf Mb^, Jlxpl dx < |P| . 

This completes the proof of (5.3), and so the control of the diagonal term. 

5.2. The Nearby Term. The nearby term concerns pairs of cubes (P, Q) £ Pnearby, that is, good 
cubes in g ] xg 2 with the properties 2 T £Q < IV and Q C 3P\P. This bilinear form can be written 
as a sum over S £ 5 1 of the terms 

(5.4) l [S(Al} (f 1 ) s -(T|3^A Q f 2 )+ (TApfi, Aqf 2 ) . 

Q : Qc37tS\7tS P:tt s i P=S Q:Qc3P\P 

2 r iQ<inS " 2HQ<1? 
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The absolute value of the second term is estimated by a sum over integers s > r of the following 
terms that are subject to condition 2 s tQ = IV. By using Lemma 5.5 and then following the 
argument of (4.2), with obvious changes, we get an upper bound for a fixed s > r, 

Y_ Y- l(TA P fi,A Q f 2 )| 

P:tt s , P=S Q:QC3P\P 
2 s fQ=«P 



<2 



•sty 



L L 

P:tt s , P=S Q : Qc3P\P 
2 S £Q=«P 



inf M|3s(x) ■ ((Dpf] |) P 

xeQ 



□ Q f 2 dx < 2- sri '|S| . 



The right hand side is summable in s. The same method of proof controls the first term in (5.4); 
alternatively, one may apply the Hardy's inequality, Lemma 3.17. We omit the details. 

We conclude our analysis of the nearby term by the basic Lemma that relies on goodness of Q. 



5.5. Lemma. Let (P, Q) G Pnearby with n s ^ = S. Then 

|(TApf l5 A Q f 2 )| < (tQ/t?y' ■ inf M|3^(x) ■ (IDpf^p 
Hererj' = ri(1 - e) > 0. 



□ nf2 d.X . 



Proof. By assumption, Q C 3P \ P and 2 r £Q < IV. Since Q is good, |x — xq| < \y — Xq|/2 for 
every x G Q and tj G P. Hence, the kernel smoothness condition applies, and we can estimate as 
follows with Xq the center of Q. 



| (TApf ] , A Q f 2 ) | 



{K(x,-uJ -K{x Q ,y)}A P f } [y)A Q f 2 {x) dydx 



< 



JQ 

< (iQ/i? 



p l*Q-yl n+11 

ti(1-e 



A P f 1 (y)A Q f 2 (x) dydx 

|Aqf 2 | dx. 



inf MApf ! fx 

xGQ 



Since Q G Q 1 , 

|A Q f 2 | dx < n Q f 2 dx. 

Furthermore, by definition, 

|A P fi| = |D P f, • (3^| 



Y_ l(Dpfi) P '| • \&\ ■ ly\ £ {\D?h\)? ■ 
P'ech(P) 
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In particular, MApfi < (|D P fi|)p • M.fi\, so the desired estimate follows. 



□ 



5.3. The Far Term. The far term concerns pairs of cubes (P, Q) G Vf ar , satisfying £Q < £P and 
3P fl Q = 0. The goodness of these cubes is irrelevant here. 

The far term is clearly bounded by the sum over integers s > and t > 1 of 



(5-6) Y. L 



|(TA P fi, A Q f 2 )| . 



P Q : 2 t_1 1 P<dist(P,Q]<2 t f P 

2 s eQ=ep 

Here, the summation is further restricted to pairs (P, Q) 6 Vf ar . By Lemma 5.7, we obtain the 
following upper bound for the term (5.6). 



2-Tis-(n+T))t 



L L 



(□pfl) P ■ In • □ Q f 2 dx 



P Q : 2 t -'fP<dist(P,Q)<2 t «P 
2 s lQ=iP 

1/2 



1 V2 



Pee 1 Qee 2 

< 2- r l(s+t)|Q0| 

Observe that in the first estimate we lose a factor 2 nt/7 twice, because of additional summation 
associated with both of the square functions. In order to see this for the first square function, one 
changes the order of summation and integration, and then applies inequality | "F* | 1 Y_q IQI ^ 2 tn 
for each P inside the P-summation. 

The last bound above is still summable in s and t, so that we are left with proving the following 
basic Lemma. 



5.7. Lemma. Let (P, Q) e V far . Then 

|(TA P f b A Q f 2 )| < (ZQ/ZPf ■ {^^j n " • (DpfOp • 



Q 



□Qj2 



Proof. Since dist(P, Q) > IV, the kernel smoothness condition applies with xq the center of Q. 



| (TApf i , A Q f 2 



< 



{K(x,y)-K(x Q ,y)}Apf 1 (y)AQf 2 (x) dydx 



p I^Q-yl n+n 



A P f 1 (y)A Q f 2 (x) dydx 



< (£QJ • dist(P, Q)- n ■ inf MA P f, (x) 



|Agf 2 | dx 
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Observe that 



p 



Apf 1 1 dx < Dpf i dx 
Jp 



JQ 



|A Q f 2 | dx < DQf 2 dx. 
Jq 



Thus, 




(□ P fi)p. 



The desired estimate follows by combining the estimates above. 



□ 
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